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Abstract 

This paper addresses the performance of bit-interleaved coded multiple beamforming (BICMB) [1], [2] with 
imperfect knowledge of beamforming vectors. Most studies for limited-rate channel state information at the 
transmitter (CSIT) assume that the precoding matrix has an invariance property under an arbitrary unitary transform. 
In BICMB, this property does not hold. On the other hand, the optimum precoder and detector for BICMB are 
invariant under a diagonal unitary transform. In order to design a limited-rate CSIT system for BICMB, we propose 
a new distortion measure optimum under this invariance. Based on this new distortion measure, we introduce a new 
set of centroids and employ the generalized Lloyd algorithm for codebook design. We provide simulation results 
demonstrating the performance improvement achieved with the proposed distortion measure and the codebook 
design for various receivers with linear detectors. We show that although these receivers have the same performance 
for perfect CSIT, their performance varies under imperfect CSIT. 



I. Introduction 

It is well-known that multiple-input multiple-output (MIMO) systems enhance the throughput of wireless 
systems, with an increase in reliability and spectral efficiency [3], [4], [5]. While the advantages of MIMO 
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architectures are attainable when only the receiver side knows the channel, the potential gains can be 
further improved when the transmitter has some knowledge of the channel, which is known as channel 
state information at the transmitter (CSIT). CSIT can be used to improve diversity order or array gain of 
a MIMO wireless system. In this work, we are interested in multi-stream precoding to achieve MIMO 
spatial multiplexing. In this paper "spatial multiplexing order" refers to the number of multiple symbols 
transmitted, as in [6]. This term is different than "spatial multiplexing gain" defined in [7]. Throughout the 
paper, we will employ the terminology single beamforming and multiple beamforming to refer to single- 
and multi-stream precoding, respectively [8], [9]. 

Precoders based on perfect CSIT are designed in [10], [11], [12] for many different design criteria. 
The majority of the designs include the channel eigenvectors which are obtained through the singular 
value decomposition (SVD) of the channel. It is well-known that it may not be practical to have perfect 
CSIT. In this paper, we will design a system with limited CSIT when the channel obeys the standard 
block fading (quasi- static) model. In this model, the channel may change from block to block, but remains 
constant during the transmission of a block. This model is commonly used in the design and simulation 
of broadband wireless systems. 

Recently, limited CSIT feedback techniques have been introduced to achieve a performance close to 
the perfect CSIT case. In these, a codebook of precoding matrices is known both at the transmitter 
and receiver. The receiver selects the precoding matrix that satisfies a desired criterion, and only the 
index of the precoding matrix is sent back to the transmitter. Initial work on limited feedback systems 
concentrated on single beamforming where a single symbol is transmitted along a quantized version 
of the optimal beamforming direction. Authors of [13] analyzed single beamforming in a multi-input 
single-output (MISO) setting where they designed codebooks via the generalized Lloyd algorithm. The 
relationship between codebook design for quantized single beamforming and Grassmannian line packing 
was observed in [14], [15] for i.i.d. Rayleigh fading channels. This connection enabled the authors in 
[14], [15] to leverage the work already carried out for optimal line packing in the mathematics literature. 
Authors in [16] proposed a systematic way of designing good codebooks for single beamforming inspired 
from [17]. Rate-distortion theory tools were used in [18] to analyze single beamforming performance 
when the generalized Lloyd algorithm is used. Random vector quantization (RVQ) technique, where a 
random codebook is generated for each channel realization, was used to analyze single beamforming in an 
asymptotic scenario [19]. Later, results were generalized to multiple beamforming [20], [21]. The results in 
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[20] showed that there is a relation between codebook design for multiple beamforming and Grassmannian 
subspace packing. However the results in [20] are specific to uncoded multiple beamforming. Most papers 
considered the unitary or semi-unitary constraint on the precoder since the optimal linear precoder is 
unitary with perfect CSIT for linear receiver architectures [8]. In such a case, it is possible to exploit the 
properties of unitary matrices and parameterize the optimal precoder into a set of angles to be quantized 
[22], [23]. hat with the 

It has been shown that for a MIMO system with N transmit and M receive antennas, it is possible 
to achieve full spatial diversity of NM, independent of the number of streams 1 < S < min(iV, M) 
transmitted over quasi-static Rayleigh flat fading channels. One possible system achieving this limit is the 
so-called bit-interleaved coded multiple beamforming (BICMB) [1], [2]. Design criteria for the interleaver 
and the convolutional encoder which guarantee full diversity and full spatial multiplexing are provided 
in [1], [2]. Previously, bit- interleaved coded modulation (BICM) [24], [25] was employed in single- 
and multi-antenna systems without utilizing CSIT [26], [27], [28], [29]. In general, BICMB requires 
perfect knowledge of only channel eigenvectors at the transmitter, i.e., does not need the channel gains 
(eigenvalues) at the transmitter. It has linear detection complexity and needs a simple soft-input Viterbi 
decoder. It also achieves full diversity without any adaptation for the number of streams. 

In this paper, the goal is to design a limited feedback scheme for BICMB. We first deal with codeword 
selection criterion assuming that there is already a given codebook. We provide a new optimal distortion 
measure for the selection of the best precoder from the codebook. This new distortion measure is due to 
the non-uniqueness property of the SVD [30]. We then calculate a centroid for this new distortion measure. 
We analyze the performance of the proposed distortion measure for different receiver structures through 
extensive simulations. For comparison purposes, we first use a randomly generated codebook. Next, we 
utilize the generalized Lloyd's algorithm [31] to design better codebooks. For this new codebook, we 
employ the minimum mean square error (MMSE) and the zero-forcing (ZF) receivers as well as a new 
receiver. 

Notation: iV is the number of transmit antennas, M is the number of receive antennas. The symbol 
S denotes the total number of symbols transmitted at a time (spatial multiplexing order, in other words 
the total number of streams used). The superscripts (•)*, {-) H , (•) T ', (•)*, and the symbol V denote the 
pseudoinverse, Hermitian, transpose, complex conjugate, and for-all respectively. 



4 

II. System Model 

In the limited feedback context, authors of [20] showed that, in their uncoded system, for both the 
ZF and the MMSE receiver the optimal precoder is in the form of VQ, where is V is the channel right 
singular matrix and Q is any unitary matrix. This characterization enabled authors to see the direct relation 
between codebook design for multiple beamforming and Grassmannian subspace packing. However, as 
we will show, in our system, multiplication of the channel right singular matrix V with a general unitary 
matrix Q, and employing VQ as the precoding matrix causes performance degradation. A new selection 
criterion and codebook design procedure is needed for limited feedback in BICMB. 

In BICMB, the output bits of a binary convolutional encoder are interleaved and then mapped over 
a signal set % C C of size \x\ = 2 m with a binary labeling map /j, : {0, l} m — > x- We use the same 
interleaver that was previously employed for the perfect CSIT case in [2]. The interleaver is not unique 
and not necessarily the optimal one, but satisfies the design criterion and enables the system to have full 
diversity when perfect CSIT is available. Gray encoding is used to map the bits onto symbols. During 
transmission, the code sequence c is interleaved by n, and then mapped onto the signal sequence x e X- 

Let H denote the quasi- static, flat fading M x N MIMO channel, where M and N are the number of 
receive and transmit antennas, respectively, and assume perfect timing, synchronization and sampling. In 
this paper, we assume that the transmitter employs multiple beamforming prior to the transmission of the 
complex baseband symbols. When S symbols are transmitted at the same time, the system input-output 
relation between transmitted and received baseband complex symbols can be written as 

y = HV L x + n (1) 

where x is an Sx 1 vector of symbols to be transmitted, n is an Mx 1 additive white Gaussian noise vector 
whose elements have zero mean and variance iV = N/SNR, and V L is an N x S precoding matrix, 
which is dependent on the instantaneous channel realization. The total power transmitted is scaled as N. 
The channel matrix elements are modeled as i.i.d. zero-mean, unit-variance complex Gaussian random 
variables. Consequently, the received average signal-to-noise ratio is SNR. 

We assume that the receiver selects a precoder matrix from a finite set of beamforming matrices and 
sends the index of the selected precoder through an error-free feedback link without any delay. Precoded 
symbols are transmitted over the channel and at the receiver a linear equalizer is used as a detector prior 
to the Viterbi decoder. Our aim is to investigate the effects of imperfect CSIT on the BICMB system 
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compared to the perfect CSIT scenario and therefore, we concentrate on a linear detector followed by soft 
input non-iterative Viterbi decoder as in [1], [2]. In this paper we do not consider nonlinear detectors or 
iterative decoding techniques. 

The bit interleaver of BICMB can be modeled as n : k' — > (k,s,i) where k' denotes the original 
ordering of the coded bits Ck>, k denotes the time ordering of the signals x k ^ s transmitted, s denotes the 
subchannel used to transmit x k)S , and i indicates the position of the bit c k < on the symbol x k>s . Let xl 
denote the subset of all signals x G \ whose label has the value b G {0, 1} in position i. The bit metrics, 
i.e., Y(yk,s, ca,'), are dependent on the receiver structure and will be revisited in Section ITlI-D I The Viterbi 
decoder at the receiver makes decisions according to the rule 

c = argmin VV(?/fc, s , c fc /). (2) 
k' 

III. Bit-Interleaved Coded Multiple Beamforming 
A. Background on SVD 

As stated previously, the work in this paper depends on the fact that SVD has an invariance property 
under diagonal unitary transformation. We provide a formal description of this fact below [30]. 

Theorem 1: If H G C MxN has rank k, then it may be written in the form H = UEV^, where U 
and V are unitary matrices whose columns are the left and right singular vectors of H. The matrix 
£ = [aij] G IR MxAr has (Tij = for all i ^ j, and a u > ■ ■ ■ > a kk > 0, and a k+ i jk+ i = ■ ■ ■ = a qq = 0, 
where q = min(iV, M). The numbers an = <Ji,i = 1,2, ... ,q are the nonnegative square roots of the 
eigenvalues of HH H , and hence are uniquely determined. The columns of U are eigenvectors of HH fl 
and the columns of V are eigenvectors of H H H. If N < M and if H H H has distinct eigenvalues, then 
V is determined up to a right diagonal unitary matrix D = diag(e :,e ' 1 , e^ 2 , . . . , e^ N ) with all 9i G [0, 2n); 
that is, if H = UxSVf = U 2 SVf , then V 2 = ViD. 

Proof: See [30]. ■ 

The conditions of the theorem above hold for the system in this paper, and therefore there are infinitely 
many right singular matrices for a given channel realization. Note that when S < q streams are transmitted, 
the first S columns of V, i.e., V, are employed. Therefore, if H = UiEVf = U 2 SVf , then V 2 = V\D 
and U 2 = UiD, where D is any S x S diagonal unitary matrix. 
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B. Selection Criteria 

In this section, we assume that there exists a codebook and we wish to find a criterion to choose 
the best approximation to V from the codebook V = {Vi}f =1 , where C is the codebook size. One 
could potentially use the well-known Euclidean metric, however the property described in Theorem 1 
complicates the problem. 

Selection Criterion - Euclidean (SC-E) : The receiver selects Vi such that 

V L = argmin || V - V* \\ 2 F . (3) 

v»ev 

This selection criterion aims to find the codebook element closest to the optimal beamforming matrix V. 
It can be argued that this criterion asymptotically diagonalizes the system as the number of feedback bits 
goes to infinity. 

However, the property in Theorem 1 makes straightforward application of © nonpractical. This can 
be explained with the aid of Figure \\\ Assume that an application of SVD for a given instantiation of 
the H matrix yields a V matrix. Assume that when V is multiplied by all diagonal unitary matrices D, 
one gets the set S v in Figure \T\ It should be clear that the closest member of V to V is not necessarily 
the closest member of V to §v- As a result, one needs to modify © such that the minimum distance 
between two sets V and S v can be calculated. A way to accomplish this is 

V L = argmin || VD-V, ||| (4) 

VieV,D£D 

where D stands for the set of all diagonal unitary matrices. 

Proposition 1: The minimization in © is equivalent to the following minimization problem 

V L = argmin 1 1 VD op * - V* | \%. (5) 

V»GV 

The k th diagonal element of the diagonal matrix D opi is given as 

e° k pt = -4>k fc = i,2,...,s (6) 

where < 0^ < 27T is the phase of v^v fe and where the vectors w ik and v fc correspond to the k th column 



of Vj and V, respectively. 

Proof: Without loss of generality, let N < M and S = N streams be used. For the other cases, the 
matrices are replaced by their first S columns. The term to be minimized in © can be expressed as 



VD - V 



* I If 



2tr[I] - tr Vf VD + (Vf VD) H 
2N - 2tr [sR[Vf VD] 

N 



k=l 



(7) 



(8) 



where D = diag(e J,6>1 , e^ 2 , . . . , e^ N ), and correspond to the column of V and V, respectively. 
Minimizing © is equivalent to maximizing the second term in ©. It is easy to see that the optimal value 
of 9k maximizing the summation in d8]) is 



or = -< 



fc = l,2, 



AT 



(9) 



where < fc < 2n is the phase of v^v fc = |v^Vfc|e^ fc . 



Proposition 1 results in the following optimal selection criterion in the Euclidean sense. 



Selection Criterion - Optimal Euclidean (SC-OE) : The receiver selects V^ such that 

V L = argmin 1 1 VD opt - V 1 1|- (10) 

v»ev 

Note that, in (fTOl) . D op * depends on both V and V^. Employing (TTOl) . one can apply the well-known 
generalized Lloyd algorithm [31] to design an optimum codebook V. The resulting codebook can then be 
used together with (fTOl ), as a limited-rate CSIT BICMB system. To that end, we will need centroids for 
the generalized Lloyd algorithm. We will calculate these new centroids in the next subsection. 

C. Codebook Design 

Our codebook design is based on generalized Lloyd's algorithm [31]. We will minimize the average 
distortion 



J = E 



mm 

Viev 



VD op '- V III 



(ID 
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Here, the distortion measure we intend to use is 

d(y i} V) = || V-Vi |||. (12) 

But, due to the previous discussion, we need to calculate the distortion between each V* and the whole 
set §v As a result, we employ 

£Zi(Vi,V) = || VD^-Vi ||| (13) 

due to the nonuniqueness property of SVD. We assume that B bits are reserved for the limited feedback 
link to quantize the optimal beamforming matrix. In this algorithm, we will begin with an initial codebook 
of matrices V = {Vo,k}l=i an( ^ heratively improve it to generate a set of matrices V m = {V m! k}k=i 
until the algorithm converges. The algorithm can be summarized by the following steps: 

1) Generate a large training set of channel matrices, H(n) and their corresponding right singular matrices 
V(n). Let * be the set of all V(ra)s. 

2) Generate an initial codebook of unitary matrices, V = {Vo,k}k=i- 

3) Set m = l. 

4) Partition the set of training matrices into P = 2 B quantization regions where the k th region is defined 

as 

X t = {Ve*| || VD op '-V m _ ljfc HI < || VD^-V^,, III VA;^/} (14) 

5) Using the given partitions, construct a new codebook V m , with the k th beamforming matrix being 



V mjfc = arg min E 
V: v H v=i 



VD opi - V III I V G Xa 



(15) 



6) Define 



J ™ = S 2 I' V H D ° P * - V^i HI (16) 

»=1 n:V(n)^V ma 

where V(n) — >■ V m i means V m]i = argmin di(Vj, V(n)) . If (J m -i — J m )l J m -i > e, set m = m + 1 

v,ev m 

and go back to Step 4. Otherwise, terminate the algorithm and set the codebook V = V m . 

The optimal solution of the optimization problem in (fT3T) gives the optimal centroid for the corresponding 



region. The distortion measure to be minimized can be rewritten as 



VD opi - V. | \ 2 F = 2iV - 2tr ^[V^VD^ 

" N 
_s=l 

2iV-2$>fv 



2iV-2K 



v.,e 



jo: 



(17) 



where (fTVl) follows by using the optimal 9" pt previously derived in ©. Therefore the original optimization 
problem in (fT5T ) can be rewritten as 



V 



m,k 



arg max E 
V: v H v=i 



' N 

E 

s=l 



VgX, 



(18) 



The maximization problem above does not have a tractable analytical solution. Next, we will modify 
the problem to find an approximate analytical solution. Note that the expectation in (TTSl can be written 
as the sum of expectation of each term due to the linearity of the expectation operation. We will relax 
the unitary constraint on V and replace the constraint with having unit norm columns. In this case, the 
modified optimization problem is equivalent to finding independent optimal vectors which maximize each 
expectation in ([TBI . The individual maximization problem becomes 



e m fc = arg max 



e: ||e||2=l 



|e%| | V, G 



t = l,2,...,N 



(19) 



where Xj^ corresponds to the space of the i th column of the elements in X^. The optimal solution for 



W is [32] 



-(») 

m,k 



principal eigenvector of E 



where the numerical averaging over X^ is substituted for expectation during codebook design. Let E 



v iV f I v< G X 



(20) 



be the matrix whose columns are found from (1201) . maximizing the expectation in (1191 ) and approximating 
the maximization in (TT81) . Note that this matrix is not necessarily unitary, therefore to find the centroid 
we will utilize Euclidean projection to find the closest unitary matrix as follows 



V mjfc = arg min | 

V: V«V=I 



E u - V 



12 
\F- 



(21) 
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The closest unitary matrix can be found in closed form as [30] 

V m , fc = VW H (22) 

where E m ,fc = USW ff . 

The approach explained above to find the centroid in each region reduces to the optimal solution for the 
single beamforming case. Although it may be suboptimal for the multiple beamforming case, the centroid 
found from (|22|) enables the algorithm to have monotonic decrease in average distortion given by (fTTI) 
in each iteration and to converge to a local minimum, as shown in Figure [2] for a 2 x 2 scenario with 2 
streams and 4-bit feedback. 

D. The Receiver 

We will first discuss the ZF and MMSE receivers and then describe a receiver based on SVD. We show 
in the appendix that the performance of these three decoders is the same when there is perfect CSIT 

1) ZF Receiver 

When there is only limited feedback for the quantization of V, i.e., V L is used as the precoder, the 
diagonalization of the channel will be lost and with the ZF detector, the system input-output relation 
becomes; 

r = Gy = x + Gn (23) 

where G = (HV L ) f = [(HV L ) ff (HV L )] _1 (HV L ) ff . In this case, we will use the following bit metrics 
[33], 

i 12 

7 4 ( r M,Cfc') = min 2 (24) 

ae *v He. 1 1 

where r k jS is the received signal after equalization at time k on the s th stream and g s is the s th column 
of G T . 

In the perfect CSIT case, where the channel right singular matrix V is perfectly known at the transmitter, 
the bit metrics (|24)) of the ZF receiver are equal to that of the optimum BICMB receiver. The proof is 
provided in the Appendix. 
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2) MMSE Receiver 

MMSE detector is a superior solution to the linear detection problem which balances ISI against noise 
enhancement. The corresponding input-output relation is given by (1231) . where now G is given by 

(25) 



G = [(HV L )"HV L + a 2 I]-\HV L ) H 



and where o 2 = N/SNR from the system model given in Section HO We will use the following bit metrics 
[34] 

2 



7 {r k ,s,c kl ) = mm — 

xexl., 1 - vv s 



W„. 



x 



(26) 



where W = [I + a 2 [(HV L ) H HV L ]- 1 ]- 1 and W ss is the s th diagonal element of W. 

In the perfect CSIT case, the MMSE receiver is equivalent to the optimum BICMB receiver. The proof 
is provided in the Appendix. 



3) SVD Receiver 

In the case of perfect knowledge of V at the transmitter, the receiver can use the XJ H matrix to 
diagonalize the channel, where H = USV^. In the case of limited feedback, the XJ H matrix can still be 
used as an equalizer [22], [35]. In this section, we will provide a linear detector which performs the same 
as the XJ H detector with lower complexity. Note that, we proposed an optimum selection criterion in (TTOl) 
which is needed because of the nonuniqueness property of SVD. The optimized selection criterion aims 
to quantize VD opi instead of V. Each element of the diagonal unitary matrix D op * can be found from © 
and it is dependent on the codebook elements and the instantaneous channel realization. From Theorem 
1, it is easy to see that there is a unique matching left singular matrix for VD opi , which can be used as 
a detector. Therefore the corresponding linear equalizer matrix is 

G = (UD op f . (27) 

In this case, when is used as a precoder at the transmitter, the baseband system input-output relation 

is 



r = GHV L x + Gn 
= (D op f SV H V L x + n' 



(28) 
(29) 
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where in (|29l ) H is replaced by its SVD. Note that because G is a unitary transformation the noise vectors 
n' and n have the same statistics. Then the input-output relation for the s th stream becomes 

s 

r s = \ s e~i e ° Pt v?v L ,iXi + n' s (30) 

i=l 

S 

= \ s e~ j9 ° P \f v iiS x s + \ s e- j9 ° pt ^ vf v LtiXi + n' s (31) 

i=l, i^s 

S 

= K\vfv L ,s\x s + \ s e- j9 ° Pt v?v Lii Xi + n' s . (32) 

1=1, iy^S 

Note that the first term has the desired signal, the second term is interference from other streams, and the 
third term is noise. The transmitted symbols Xi are typically from symmetric constellations. Therefore, 
the mean of Xi is zero. As discussed previously, we normalize its variance to 1. Due to bit interleaving, 
Xi, i = 1, 2, . . . , S are uncorrelated. For a given channel realization, (l32l) can be written in a compact 
form as 

r s = X s x s + n s (33) 

where \ s = A s |vfv£ iS | and n s is approximated as a zero-mean complex Gaussian random variable with 
variance a 2 s = X^^ i=li , s Ivfv^l 2 + N/SNR. We determined through simulations that the Gaussian 
approximation is highly accurate for low and intermediate SNR values (e.g., 15 dB) or when the number of 
feedback bits is beyond 4. Although for large SNR (e.g., 30 dB), the approximation is less accurate, as the 
feedback rate increases, the accuracy loss diminishes independent of SNR. In addition, this approximation 
enables a very simple bit-metric calculation similar to the perfect CSIT case. 

Let xl denote the subset of all signals x E x whose label has the value b E {0, 1} in position i. The 
bit metrics for (|33l) are given by [24] 

it \ ■ \ r k,s ~ ^sX\ /ia\ 

7 {n,s,c k i) = mm — . (34) 

In the sequel, we will call the receiver proposed in this section as the SVD receiver. We will show 
in the next section that the performance of the SVD receiver is close to that of the MMSE receiver for 
the 2x2 MIMO system. The advantage of the SVD receiver over the MMSE receiver is its relative 
simplicity since it avoids the matrix inversions needed in (T23T) and d26l) . One can observe that when the 
limited feedback rate is low, the interference term may dominate the noise term, which may result in 



13 

poor performance. We emphasize that the optimum receiver with a linear detector for the limited-rate 
CSIT system described in the previous section is the MMSE receiver. However, the SVD receiver is a 
simpler one with a performance tradeoff against the MMSE receiver while consistently outperforming the 
ZF receiver. 

IV. Simulation Results 

In the simulations below, the industry standard 64-state 1/2-rate (133,171) df ree = 10 convolutional 
code is used and the constellation is 16-QAM. As in all similar work, the channel is assumed to be 
quasi-static and flat fading. 

Figure [3] illustrates that in the case of BICMB, the precoder matrix V is not invariant under a 
general unitary matrix transformation. As discussed previously, assumption of this invariance results in 
the Grassmannian codebook design approach studied widely in the literature [20]. Again, as discussed 
previously, most of the work in the literature is for uncoded systems where invariance under a general 
unitary matrix transformation follows from the use of optimization criterion such as MSE, SNR, or mutual 
information. All curves in this figure employ BICMB with ZF receiver, while the solid ones employ the 
V matrix given by SVD of H, those with broken lines employ V = VQ where Q is a 2 x 2 DFT 
matrix, which is unitary. Clearly, BICMB performance is not invariant under a general unitary matrix 
transformation. 

Figure |4] shows a number of different systems to illustrate the improvement due to the new selection 
criterion (flOl) . This selection criterion is compared to the one that maximizes the minimum eigenvalue 
(A min ) of HVj. This method is employed in [20] with the ZF receiver. In order to show that there is 
a gain due to (flOl ), we use the ZF receiver in our system as well. Systematic generation of codebooks 
[17] with a selection criterion that maximizes \ min is used for the curves with legend SC-A m j„ and the 
randomly generated codebook with the selection criterion in © and © is used for the curves with legend 
SC-OE. As can be seen, the performance is improved significantly with the proposed approach, and with 
X m in approach, the performance saturates with increasing the number of bits. 

Figure \5\ compares ([3]) and (flOl) employing two receiver structures: ZF and MMSE. The codebook 
employed is randomly generated. There is clearly a significant gain due to (flOl) for both receivers. In 
Figure [6] the performance of the SVD receiver is compared with the ZF and MMSE receivers for the 
2x2 scenario with 2 streams. All curves in the figures use the optimized Euclidean criterion with a 
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randomly generated codebook. The SVD receiver, which exploits the nonuniqueness of SVD both at the 
transmitter and the receiver, significantly outperforms the ZF receiver and achieves a performance very 
close to the MMSE receiver for the 8-bit scenario. Note that, the overall complexity of the system with 
the SVD receiver is less than the one with the MMSE receiver. When the number of feedback bits is 8 
for the 2x2 case, it achieves a performance 0.25 dB close to the unquantized system. 

Figure [7] shows the simulation results for various receivers in a 2 x 2 system when the codebook is 
designed using the VQ algorithm discussed in Section ITlI-Ci All curves use the optimal Euclidean criterion. 
As seen from the figure, the performance of the randomly generated codebook (RVQ) can be significantly 
improved for all receivers. To illustrate, the performance of MMSE 8-bit RVQ and 6-bit VQ are very close 
to each other, therefore 2 bit reduction is achieved via the proposed codebook design. A similar reduction 
can be observed for the SVD receiver. On the other hand, for the same number of feedback bits, 2 dB 
performance gain is achievable for the ZF receiver. Note that there is significant performance degradation 
when the ZF receiver is used for both RVQ and VQ scenarios compared to the MMSE and SVD receivers. 

V. Conclusion 

BICMB is a high-performance and low-complexity broadband wireless system with full spatial multi- 
plexing and full diversity. However, the system requires perfect knowledge of the channel right singular 
vectors, which is not practical in a real environment. 

This paper addressed the performance of BICMB with limited CSIT feedback using a codebook-based 
approach. We proposed a new optimal distortion measure for selecting the best precoder from a given 
codebook. The centroids for this distortion measure are calculated. Codebook design is performed via the 
generalized Lloyd algorithm based on the new distortion measure and the new centroids. We provided 
simulation results demonstrating the performance improvement achieved with the proposed distortion 
measure for various receivers with linear detectors. 

Appendix 

In the perfect CSIT case, the transmitter uses the right singular matrix V as the precoding matrix V^. 
The N x S precoding matrix can be expressed as = V$jv, where the N x S matrix Q> N is used to 
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select the first S columns of V, defined as 



N 







N-S,S 



and Ons,s is an (N — S) x S matrix whose elements are all zeros. Therefore, the system input-output 
relation in (OQ) can be written as 



y = USx + n, 



(35) 



where £ is defined as 



N 



and £5 is an S x S square matrix whose elements are taken from the largest S singular values of H. 



1) BICMB Receiver 

The optimum detector for the BICMB receiver is the corresponding left singular matrix XJ H . The 
baseband input-output relation for each subchannel becomes [2] 

r' ks = X s x ktS + n ktS (36) 

for s = 1, 2, . . . , S where X s is the s th channel singular value and r' k s is the detected symbol of the s th 
subchannel at the k th time instant which is defined as in (0. Then, the following ML bit metrics for the 
BICMB soft input Viterbi decoder are used [1], [2] 

lBicMB( r 'k,s,Cki) = min \r' k - X s x\ 2 (37) 

where k' is defined as in ©. 

2) ZF Receiver 

After the ZF detector, the system input-output relation becomes 

r ZF = t^V H y = x + S+U^n (38) 
where G in ([23]) is replaced by (HV L )' = (UEV^V^t = S f U H . Note that the last equality (AB)t = 
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B^A* holds if A ff A = I [36]. Accordingly, the baseband input-output for each substream becomes 



fk, s = Xk, s + A s x nk,s = A s Vfc jS , 



(39) 



where the relation with r' k s is obvious when (139b is compared with (1361) . 

To calculate the s* h column of G T for metric calculation in (l24l) . consider U = (ui:u 2 : . . . :um), where 
Ui, u 2 , . . . , u M are the column vectors of U. Then, 



G 



(±*U H ) T = (uJiuS: . . . \u* M ) 



Ss 1 







M-S,S 



(40) 



Therefore, the s* h column of G T in (|24l) is equal to A s 1 u s , leading to ||g s || 2 = 1/Ag. By replacing ||g s || 2 
and r k s in (|24l) with 1/A 2 and A^ 1 ?^ s , respectively, the bit metrics for the ZF decoder become 



mm |r A 



\ s x\ 



(41) 



which are equal to the bit metrics of BICMB in (|37T) . 



3) MMSE Receiver 

The MMSE detector G in (|25]> with perfect CSIT becomes 

G= [(HV* JV ) H HV* JV + a 2 I]- 1 (HV* JV ) H 

= [V 2 S + a 2 !]-^ 11 ^ . (42) 

If we define Ct as 

ft = S| + a 2 I (43) 

then, O is an S 1 x S diagonal matrix whose s th diagonal element can be expressed as /i s = X 2 + a 2 . The 
baseband signal after the MMSE detector given in ([23]) is 

r M MSE = n~ 1 t H V H y = n -1 £|x + fr^U^n (44) 
where G is replaced by the shortened form of (|42l) and (|43l . Since f2 -1 , £ and S| are all diagonal 
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matrices, the baseband vector signal can be separated into each subchannel signal, resulting in the following 
relation with r' k s of (1361) as 

A? X s X s . 

r k ,s = —x k ,s H n fcjS = — r fcjS . (45) 

/^s /^s f^s 

The bit metrics in (l26l) require the calculation of a matrix W. Using an analysis similar to the MMSE 
detector, W can be expressed as 

W=[I + <7 !, (E|)- 1 ]- 1 . (46) 

By multiplying (S|) _1 with the both sides of (l43~T) . we get 

I + a^SD-^fi^l)- 1 . (47) 

Using (l46l) and (l47l) . the s th diagonal element W ss of W can be easily found as W ss = \ 2 S / ' ix s . Finally, 
with the help of simplified W ss and the relation with r' k s of (|43T) . the bit metrics in (|26|) become 

lMMSE(h,s, cjfcO = min -^K - A s x| 2 (48) 

which are equivalent to the bit metrics of BICMB in (1371 ) because the constant 1/cr 2 can be ignored in 
the metric calculation. 
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Fig. 2. Average distortion for 2 x 2 system with 4-bit feedback. 



21 





Fig. 4. New selection criterion vs. A min -based selection criterion 3x2 system with 2 streams. 
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Fig. 6. The SVD receiver vs. MMSE and ZF receivers using SC-OE with randomly generated codebook 2x2 system with 2 streams. 
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Fig. 7. The SVD receiver vs. MMSE and ZF receivers with RVQ and VQ using SC-OE 2x2 system with 2 streams. 



